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Abstract. The Australian SKA Pathfinder (ASKAP) is a next generation radio tele- 
scope currently under construction in Western Australia. The fast survey speed and 
wide field of view make it an ideal instrument for blind transients searches. The ASKAP 
Variable and Slow Transients (VAST) survey is a one of the major science programs 
planned for ASKAP. The scientific goals of VAST include the detection and charac- 
terisation of a wide range of transient and variable phenomena, from gamma-ray burst 
afterglows to extreme scattering events, on timescales of 5 seconds or longer. 

We describe the data and processing challenges involved in running the VAST 
real-time transient detection pipeline. ASKAP will produce 2.5 GB of visibility data per 
second, transformed into one 8GB image cube every 5 seconds. Each cube will contain 
approximately twenty 100 megapixel images with 100s of radio sources detected in 
each epoch. The VAST pipeline will measure and monitor all of these sources, detect 
variables and transients and generate alerts using the VOEvent framework. 

The goal of the VAST Design Study is to develop a prototype pipeline to establish 
and demonstrate the functionality of the final ASKAP pipeline. We give an overview 
of the prototype pipeline's functionality, technical implementation and current status. 



1. The Australian SKA Pathfinder (ASKAP) 



The Australian SKA Pathfinder (ASKAP, IJohnston et all l2008h is a powerful radio 



telescope currently under construction in Western Australia. It is a 36 dish radio inter- 
ferometer with exceptional survey speed: it can image the entire southern sky to high 
sensitivity in two nights. The fast survey speed and wide field of view make it an ideal 
instrument for blind transients searches. 

ASKAP is located at the Murchison Radio-astronomy Observatory (MRO), ap- 
proximately 315km north east of Geraldton, Western Australia. The MRO is also the 
location of the Murchison Widefield Array (MWA) telescope and is the candidate site 
for Australia and New Zealand's bid to host the Square Kilometre Array (SKA). 

A 6 antenna array called BETA is currently being installed and will be commis- 
sioned during 2012. ASKAP is scheduled to begin scientific observations with a 6 to 
12 antenna array during 2013. 
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2. The VAST Survey 

ASKAP will give us an unprecedented opportunity to investigate the transient sky at 
radio wavelengths. Its wide-field survey capabilities will enable the discovery and in- 
vestigation of variable and transient phenomena from the local to the cosmological, 
including flare stars, intermittent pulsars, X-ray binaries, magnetars, extreme scat- 
tering events, intra-day variables, radio supernovae and orphan afterglows of gamma 
ray bursts. In addition ASKAP probes unexplored regions of phase space where new 

classes of transient sour ces may be detected. 

The VAST survey dMurphy & Chatterjeel l2009h is one of ten major science pro- 



grams planned for ASKAP. It will involve several survey regimes including one that 
images most of the visible sky every night for two years. This will produce measure- 
ments of one million sources over hundreds of epochs. 



3. Transient Pipeline Functionality 

The VAST transient pipeline will detect transients and variables by analysing the im- 
ages produced by the ASKAP imaging pipeline. The VAST pipeline operates by ex- 
tracting and measuring sources in images, constructing and analysing light curves and 
ge nerating notifica t ions o f detections. This approach is based on techniques described 
by lBannister et al.1 (1201 lh . The functionality of the VAST pipeline is illustrated in Fig- 
ure [Hand described in Table Q] 

The VAST collaboration is also investigating the techniques of transient dete ction 
using image subtraction and visibility domain transient detection dTrott et al.ll201lh . 




Figure 1 . Illustration of VAST pipeline functionality 



4. Capacity Challenges 

The traditional model for radio astronomy is to take the data from the telescope and pro- 
cess it off-site some time later. ASKAP produces too much data to store and transport 
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Table 1 . Description of VAST pipeline functionality 



Function 



Description 



Source finding 



Source association 



Image quality control 



Source monitoring 



Light curve creation 



Detect and classify tran- 
sients and variables 



Notification 



Blind source finding is performed on every image. This 
produces a list of detected sources with position, flux 
(brightness) and size. 

Each detected source is associated with one from the mas- 
ter list by finding the master source at the same position as 
the detection. If no master source is found at that position 
then a new master source is added. 

The image is assessed for quality by comparing the mea- 
sured parameters of the detected sources to a list of 
well-characterised sou rces using a technique described by 
Bannister etaD (1201 lh . 

Master sources known to be in the field that were not de- 
tected by the source finder are measured by attempting to 
fit a Gaussian at the position where the source is expected 
to be. If no source can be fit then an upper-limit on the 
flux at that position is taken instead. 

All measurements for each source are collated into a ra- 
dio light curve - a collection of flux measurements for all 
epochs where the source was observed. 

The updated light curves are scanned to detect transient or 
variable behaviour. Sources that exhibit transient or vari- 
able behaviour are classified using a machine-learning al- 
gorithm. This attempts to determine what kind of physical 
phenomenon is causing the variability. 

Detections and classifications are subjected to quality con- 
trol and cross-matching to external catalogues. Those that 
appear genuine are added to an archive and the community 
is alerted using VOEvent notifications. 



so the VAST transient survey will be implemented as a near real-time pipeline running 
on the same large computing cluster as the ASKAP imaging systems. 

ASKAP's imaging pipeline will produce two streams of images: one with inte- 
gration times of 5 seconds, the other with longer integration times depending on the 
observing mode e.g. 1 minute, 1 hour, 8 hours. Each 5 second image cube will con- 
tain approximately twenty ~100 megapixel images: ~5 frequency bands each with 4 
polarisations. Each cube will be ~8GB in size. 

In the transient pipeline the 5 second image stream will require approximately 
12,000 source measurements per second (approximately 100 sources per square de- 
gree, an image size of 30 square degrees, 20 images per cube, one cube every 5 sec- 
onds). Light curves for approximately 3,000 sources will change each second. Each 
source detection requires a cone search to associate the detection with a known source 



4 



Jay Banyer, Tara Murphy, and the VAST Collaboration 



position. 3,000 cone searches will be performed each second however most of these 
can be optimised by searching a cached list of the known sources in the current field. 
The cached list will be refreshed each time the telescope moves to a new field, which 
will happen no more frequently than every minute or so. 

The image stream with longer integrations (e.g. 1 hour) will produce images at 
a lower rate but each image will contain more sources due to higher sensitivity. The 
source measurement load for this stream will vary depending on the integration time. 

5. VAST Pipeline Prototype 

A prototype transient pipeline has been developed as part of the VAST Design Study. 
It allows iterative development of the functional requirements of the final ASKAP 
pipeline. The prototype pipeline will also be used to perform transient detection on 
data from other telescopes, including: the Murchison Widefield Array (MWA), the 
Australia Telescope Compact Array (ATCA), the Very Large Array (VLA), and the 
SKA Molonglo Prototype (SKAMP). 

The prototype takes a series of FITS images as input, detects and measures sources, 
performs image quality control, constructs light curves and outputs results to a database. 
Transient detection, classification and VO alerts are being developed. 

The prototype uses a custom source finder that is able to characterise multiple 
overlapping sources more accurately than other existing source finders (Hancock et al. 
in prep). 

A dynamic website provides a user interface to the database. Its purpose is to help 
visualise and diagnose the pipeline output easily and efficiently. It can browse the lists 
of images, master sources and detections, plot light curves, create postage stamps and 
perform cross-matching with external catalogues. 

The prototype has already proven to be a useful tool for analysing datasets from 
radio telescopes even when not looking for transients due to its ability to measure and 
report on all sources in all images with very little effort. This assists with checking the 
image quality, calibration and positional accuracy of the dataset. 

5.1. Languages, Tools and Libraries 

The prototype pipeline is developed using the Python programming language and the 
Django web framework. These were chosen for their suitability for rapidly changing 
requirements during the VAST Design Study and the availability of astronomical li- 
braries. PostgreSQL is used for the SQL database with the Q3C plugin for optimised 
celestial coordinate searches (cone searches). 
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